Add tests for custom operator implementation correctness #457

Kacper-Pietkun · 2025-10-23T12:53:59Z

I added tests for custom ops defined in vllm_gaudi/ops:

For the tests of ops that are not using cuda kernels - native ops and hpu ops are triggered for the same input and their outputs are compared
For others tests that are using cuda kernels (so cannot be called with vllm-gaudi plugin) I created separate directory to store some predefined small tensors - weights, inputs and outputs. These tensors are too big to hardcode them in tests, however their sizes were adjusted, so all of them weight less than 3MB in total. Tensors are stored in a .safetensors format. Such tests run hpu ops with loaded inputs and weights and compare their outputs with the loaded outputs.

Signed-off-by: Kacper Pietkun <[email protected]>

Kacper-Pietkun · 2025-10-23T12:54:20Z

/run-gaudi-tests

Signed-off-by: Kacper Pietkun <[email protected]>

Kacper-Pietkun · 2025-10-23T12:55:49Z

/run-gaudi-tests

github-actions · 2025-10-23T14:12:09Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
7e0941055fdf89bae93045683dd80542177f3241

Kacper-Pietkun · 2025-10-27T07:28:27Z

/run-gaudi-tests

github-actions · 2025-10-27T08:18:19Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
63b22e0dbb901b75619aa4bca2dfa1d7a71f439e

Signed-off-by: Kacper Pietkun <[email protected]>

github-actions · 2025-10-27T13:23:18Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

Kacper-Pietkun · 2025-10-27T13:24:56Z

/run-gaudi-tests

github-actions · 2025-10-27T14:16:33Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
b368382964913312d41c670b4166f4c83eed49aa

github-actions · 2025-11-04T10:23:31Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
0384aa7150c4c9778efca041ffd1beb3ad2bd694

Copilot

Pull Request Overview

This PR adds comprehensive unit tests for custom operators implemented in vllm_gaudi/ops. The tests verify correctness by comparing outputs between native VLLM operators and HPU-specific implementations. For operators using CUDA kernels, pre-computed reference tensors stored in safetensors format are used for validation.

Key changes:

Native and HPU operator outputs are compared for operators compatible with both implementations
Pre-computed reference data in safetensors format is used for CUDA kernel-based operators
Test utilities added for temporary operator registry management and test data access

Reviewed Changes

Copilot reviewed 11 out of 19 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
vllm_gaudi/ops/hpu_fp8.py	Removed unused imports and decorator
tests/unit_tests/test_bucketing.py	Added config clearing before setup
tests/unit_tests/ops/utils.py	Added test utilities for operator registration and data loading
tests/unit_tests/ops/test_hpu_rotary_embedding.py	Tests for rotary embedding operator variants
tests/unit_tests/ops/test_hpu_multihead_attn.py	Tests for multi-head attention operator
tests/unit_tests/ops/test_hpu_layernorm.py	Tests for RMS normalization operator
tests/unit_tests/ops/test_hpu_gptq.py	Tests for GPTQ quantization operator
tests/unit_tests/ops/test_hpu_fused_moe.py	Tests for fused MoE operator
tests/unit_tests/ops/test_hpu_fp8.py	Tests for FP8 quantization operators
tests/unit_tests/ops/test_hpu_compressed_tensors.py	Tests for compressed tensor operators
tests/unit_tests/ops/test_hpu_awq.py	Tests for AWQ quantization operator

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/unit_tests/ops/utils.py

tests/unit_tests/ops/test_hpu_rotary_embedding.py

Co-authored-by: Copilot <[email protected]> Signed-off-by: Kacper Pietkun <[email protected]>

Kacper-Pietkun · 2025-11-07T09:43:27Z

All of the above changes are just corrections of spelling mistakes detected by copilot

github-actions · 2025-11-07T10:34:23Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
0384aa7150c4c9778efca041ffd1beb3ad2bd694

Add custom op correctness tests

3422a61

Signed-off-by: Kacper Pietkun <[email protected]>

Kacper-Pietkun requested review from adobrzyn, afierka-intel, iboiko-habana, kzawora-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners October 23, 2025 12:54

fix bucketing

cd0917a

Signed-off-by: Kacper Pietkun <[email protected]>

Merge branch 'main' into dev/kpietkun/test_custom_op_correctness_clean

35d8193

store data as safetensors

6f2b082

Signed-off-by: Kacper Pietkun <[email protected]>

Merge branch 'main' into dev/kpietkun/test_custom_op_correctness_clean

7094c57

michalkuligowski approved these changes Nov 4, 2025

View reviewed changes

Merge branch 'main' into dev/kpietkun/test_custom_op_correctness_clean

c8b6da0

Merge branch 'main' into dev/kpietkun/test_custom_op_correctness_clean

e0d85fc

Copilot AI review requested due to automatic review settings November 7, 2025 09:35

Copilot AI reviewed Nov 7, 2025

View reviewed changes

Kacper-Pietkun and others added 2 commits November 7, 2025 10:41

Update tests/unit_tests/ops/utils.py

b653548

Co-authored-by: Copilot <[email protected]> Signed-off-by: Kacper Pietkun <[email protected]>

Update tests/unit_tests/ops/utils.py

5656ef2

Co-authored-by: Copilot <[email protected]> Signed-off-by: Kacper Pietkun <[email protected]>

Kacper-Pietkun and others added 3 commits November 7, 2025 10:42

Update tests/unit_tests/ops/utils.py

146d3a4

Co-authored-by: Copilot <[email protected]> Signed-off-by: Kacper Pietkun <[email protected]>

Update tests/unit_tests/ops/test_hpu_rotary_embedding.py

17d4a0d

Co-authored-by: Copilot <[email protected]> Signed-off-by: Kacper Pietkun <[email protected]>

Update tests/unit_tests/ops/test_hpu_rotary_embedding.py

fc92b72

Co-authored-by: Copilot <[email protected]> Signed-off-by: Kacper Pietkun <[email protected]>

michalkuligowski merged commit 0a6113b into vllm-project:main Nov 7, 2025
37 checks passed

Add tests for custom operator implementation correctness #457

Add tests for custom operator implementation correctness #457

Uh oh!

Conversation

Kacper-Pietkun commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Kacper-Pietkun commented Oct 23, 2025

Uh oh!

Kacper-Pietkun commented Oct 23, 2025

Uh oh!

github-actions bot commented Oct 23, 2025

✅ CI Passed

Uh oh!

Kacper-Pietkun commented Oct 27, 2025

Uh oh!

github-actions bot commented Oct 27, 2025

✅ CI Passed

Uh oh!

github-actions bot commented Oct 27, 2025

🚧 CI Blocked

Uh oh!

Kacper-Pietkun commented Oct 27, 2025

Uh oh!

github-actions bot commented Oct 27, 2025

✅ CI Passed

Uh oh!

github-actions bot commented Nov 4, 2025

✅ CI Passed

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Kacper-Pietkun commented Nov 7, 2025

Uh oh!

github-actions bot commented Nov 7, 2025

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Kacper-Pietkun commented Oct 23, 2025 •

edited

Loading